|
|
Registro Completo |
Biblioteca(s): |
Embrapa Gado de Leite. |
Data corrente: |
26/12/2018 |
Data da última atualização: |
24/01/2023 |
Tipo da produção científica: |
Artigo em Periódico Indexado |
Autoria: |
RIBEIRO, I. M.; BORGES, C. C. H.; SILVA, B. Z.; ARBEX, W. A. |
Afiliação: |
WAGNER ANTONIO ARBEX, CNPGL. |
Título: |
A genetic programming model for association studies to detect epistasis in low heritability data. |
Ano de publicação: |
2018 |
Fonte/Imprenta: |
Revista de Informática Teórica e Aplicada, v. 25, n. 2, p. 85-92, 2018. |
Idioma: |
Inglês |
Conteúdo: |
Abstract The genome-wide associations studies (GWAS) aims to identify the most influential markers in relation to the phenotype values. One of the substantial challenges is to find a non-linear mapping between genotype and phenotype, also known as epistasis, that usually becomes the process of searching and identifying functional SNPs more complex. Some diseases such as cervical cancer, leukemia and type 2 diabetes have low heritability. The heritability of the sample is directly related to the explanation defined by the genotype, so the lower the heritability the greater the influence of the environmental factors and the less the genotypic explanation. In this work, an algorithm capable of identifying epistatic associations at different levels of heritability is proposed. The developing model is a aplication of genetic programming with a specialized initialization for the initial population consisting of a random forest strategy. The initialization process aims to rank the most important SNPs increasing the probability of their insertion in the initial population of the genetic programming model. The expected behavior of the presented model for the obtainment of the causal markers intends to be robust in relation to the heritability level. The simulated experiments are case-control type with heritability level of 0.4, 0.3, 0.2 and 0.1 considering scenarios with 100 and 1000 markers. Our approach was compared with the GPAS software and a genetic programming algorithm without the initialization step. The results show that the use of an efficient population initialization method based on ranking strategy is very promising compared to other models. MenosAbstract The genome-wide associations studies (GWAS) aims to identify the most influential markers in relation to the phenotype values. One of the substantial challenges is to find a non-linear mapping between genotype and phenotype, also known as epistasis, that usually becomes the process of searching and identifying functional SNPs more complex. Some diseases such as cervical cancer, leukemia and type 2 diabetes have low heritability. The heritability of the sample is directly related to the explanation defined by the genotype, so the lower the heritability the greater the influence of the environmental factors and the less the genotypic explanation. In this work, an algorithm capable of identifying epistatic associations at different levels of heritability is proposed. The developing model is a aplication of genetic programming with a specialized initialization for the initial population consisting of a random forest strategy. The initialization process aims to rank the most important SNPs increasing the probability of their insertion in the initial population of the genetic programming model. The expected behavior of the presented model for the obtainment of the causal markers intends to be robust in relation to the heritability level. The simulated experiments are case-control type with heritability level of 0.4, 0.3, 0.2 and 0.1 considering scenarios with 100 and 1000 markers. Our approach was compared with the GPAS software and a genetic programming algorithm without... Mostrar Tudo |
Palavras-Chave: |
Computational Modeling; Genetic Programming; GWAS; Mathematical Modeling; Random Forest; SNP. |
Thesaurus Nal: |
Bioinformatics. |
Categoria do assunto: |
X Pesquisa, Tecnologia e Engenharia |
URL: |
https://ainfo.cnptia.embrapa.br/digital/bitstream/item/189322/1/Artigo-RevInfTeorApl.pdf
|
Marc: |
LEADER 02397naa a2200241 a 4500 001 2102526 005 2023-01-24 008 2018 bl uuuu u00u1 u #d 100 1 $aRIBEIRO, I. M. 245 $aA genetic programming model for association studies to detect epistasis in low heritability data.$h[electronic resource] 260 $c2018 520 $aAbstract The genome-wide associations studies (GWAS) aims to identify the most influential markers in relation to the phenotype values. One of the substantial challenges is to find a non-linear mapping between genotype and phenotype, also known as epistasis, that usually becomes the process of searching and identifying functional SNPs more complex. Some diseases such as cervical cancer, leukemia and type 2 diabetes have low heritability. The heritability of the sample is directly related to the explanation defined by the genotype, so the lower the heritability the greater the influence of the environmental factors and the less the genotypic explanation. In this work, an algorithm capable of identifying epistatic associations at different levels of heritability is proposed. The developing model is a aplication of genetic programming with a specialized initialization for the initial population consisting of a random forest strategy. The initialization process aims to rank the most important SNPs increasing the probability of their insertion in the initial population of the genetic programming model. The expected behavior of the presented model for the obtainment of the causal markers intends to be robust in relation to the heritability level. The simulated experiments are case-control type with heritability level of 0.4, 0.3, 0.2 and 0.1 considering scenarios with 100 and 1000 markers. Our approach was compared with the GPAS software and a genetic programming algorithm without the initialization step. The results show that the use of an efficient population initialization method based on ranking strategy is very promising compared to other models. 650 $aBioinformatics 653 $aComputational Modeling 653 $aGenetic Programming 653 $aGWAS 653 $aMathematical Modeling 653 $aRandom Forest 653 $aSNP 700 1 $aBORGES, C. C. H. 700 1 $aSILVA, B. Z. 700 1 $aARBEX, W. A. 773 $tRevista de Informática Teórica e Aplicada$gv. 25, n. 2, p. 85-92, 2018.
Download
Esconder MarcMostrar Marc Completo |
Registro original: |
Embrapa Gado de Leite (CNPGL) |
|
Biblioteca |
ID |
Origem |
Tipo/Formato |
Classificação |
Cutter |
Registro |
Volume |
Status |
URL |
Voltar
|
|
Registros recuperados : 14 | |
3. | | RIBEIRO, I. M.; BORGES, C. C. H.; ARBEX, W. A.; SILVA, B. Z. da. Programação genética com inicialização baseada em floresta randômica em estudos de associação do genooma completo. In: INTERNATIONAL SODEBRAS CONGRESS, 35., 2016, Foz do Iguaçu. Anais... Publicado na Revista Sodebras, v. 11, n. 129, set. 2016. 6 p.Tipo: Artigo em Anais de Congresso |
Biblioteca(s): Embrapa Gado de Leite. |
| |
8. | | ZONOVELLI, B.; BORGES, C. C. H.; ARBEX, W. A.; OLIVEIRA, F. C. de; RIBEIRO, I. M. Regressão com máquinas de vetores suporte e seleção de atributos via algoritmo genético aplicada em seleção genômica. Revista Sodebras, v. 11, n. 129, set. 2016. Edição do 35º International Sodebras Congress, Foz do Iguaçu, 2016.Tipo: Artigo em Anais de Congresso |
Biblioteca(s): Embrapa Gado de Leite. |
| |
9. | | OLIVEIRA, F. C. DE; ALMEIDA, F. N.; CAPRILES, P. V. S. Z.; ARBEX, W. A.; BORGES, C. C. H. Support vector machine e support vector regression para seleção de SNPs em GWAS. In: SIMPÓSIO DE MECÂNICA COMPUTACIONAL, 11.; ENCONTRO MINEIRO DE MODELAGEM COMPUTACIONAL, 2., 2014, Juiz de Fora. Resumos... Juiz de Fora: [s.n.], 2014. Tipo: Resumo em Anais de Congresso |
Biblioteca(s): Embrapa Gado de Leite. |
| |
10. | | MAGALDI, H.; BRAGA, R.; ARBEX, W. A.; CAMPOS, M. M.; BORGES, C. C. H.; DAVID, J. M. N.; CAMPOS, F. Uso de ontologias e técnicas de visualização no apoio às pesquisas em eficiência alimentar de gado de leite. In: CONGRESSO BRASILEIRO DE AGROINFORMÁTICA, 11., 2017, Campinas. Ciência de dados na era da agricultura digital: anais. Campinas: Editora da Unicamp: Embrapa Informática Agropecuária, 2017. p. 625-637.Tipo: Artigo em Anais de Congresso |
Biblioteca(s): Embrapa Gado de Leite. |
| |
11. | | MAGALDI, H.; BRAGA, R.; ARBEX, W. A.; CAMPOS, M. M.; BORGES, C. C. H.; DAVID, J. M. N.; CAMPOS, F.; STROELE, V. Análise de Eficiência Alimentar de Gado Leiteiro a partir da Integração de Bases Heterogêneas e Ontologias. In: BRAZILIAN E-SCIENCE WORKSHOP, 38., 2018, Natal-RN. Anais... [S.l.]: Sociedade Brasileira de Computação, 2018.Tipo: Artigo em Anais de Congresso |
Biblioteca(s): Embrapa Gado de Leite. |
| |
12. | | OLIVEIRA, F. C. de; ALMEIDA, F. N.; SILVA, F. F. e; SILVA, M. V. G. B.; BORGES, C. C. H.; ARBEX, W. A. Metodologia para seleção de marcadores com máquina de vetores de suporte com regressão. In: ARBEX, A.; MARTINS, N. F.; MARTINS, M. F. Talking about computing and genomics - TACG vol.1: Modelos e métodos computacionais em Bioinformática. Brasília, DF: Embrapa, 2014. p. 101-126Tipo: Capítulo em Livro Técnico-Científico |
Biblioteca(s): Embrapa Gado de Leite. |
| |
13. | | ARBEX, W. A.; SILVA, F. F. e; SILVA, M. V. G. B.; BORGES, C. C. H.; OLIVEIRA, F. C. de; VARONA, L.; VERNEQUE, R. da S. Decision Support in Attribute Selection with Machine Learning Approach. In: CONFERENCIA IBÉRICA DE SISTEMAS Y TECNOLOGÍAS DE INFORMACION, 9., 2014, Barcelona. Actas... Barcelona: Aisti; Salle, 2014. CISTI 2014Tipo: Artigo em Anais de Congresso |
Biblioteca(s): Embrapa Gado de Leite. |
| |
14. | | OLIVEIRA, F. C. de; BORGES, C. C. H.; ALMEIDA, F. N.; SILVA, F. F. e; VERNEQUE, R. da S.; SILVA, M. V. G. B.; ARBEX, W. A. SNPs selection using support vector regression and genetic algorithms in GWAS BMC Genomics, v. 15, article S4, 2014. 15 p. Suppl. 7.Tipo: Artigo em Periódico Indexado | Circulação/Nível: A - 1 |
Biblioteca(s): Embrapa Gado de Leite. |
| |
Registros recuperados : 14 | |
|
Nenhum registro encontrado para a expressão de busca informada. |
|
|